SEQUENCE LISTING 



<110> Chappie, Clinton 
Franke , Rochus 
Ruegger, Maxwell 

<120> Genes Encoding P-Coumarate 3-Hydroxylase (C3H) and Methods of Use • 

<130> N1422-005 

<150> US 60/225554 
<151> 2000-08-16 

<160> 7 

<170> Patentln version 3.0 

<210> 1 
<211> 1527 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 1 

atgtcgtggt ttctaatagc ggtggcgaca atcgccgccg tcgtatccta caagctaatc 60 
caacggctaa gatacaagtt cccaccaggc ccaagcccca agccgatcgt cggtaacctc 12 0 
tacgacataa aaccggtccg gttcagatgt tactacgagt gggctcaatc ttatggacca 180 
atcatatcgg tctggatcgg ttcaattcta aacgtggtcg tatctagcgc cgagctagca 240 
aaagaagttc tgaaagaaca cgaccagaaa ctcgccgacc ggcaccggaa cagatcgacg 3 00 





gaagcattta 


gccgcaacgg 


tcaggatctt 




aaggtgagaa 


aagtttgcac 


gcttgagctc 




cctatccgtg 


aagatgaagt 


caccgccatg 




cctgaaaaca 


gagcaaaagg 


tttacaactg 




aacataacgc 


ggctagcctt 


tgggaagcgt 




caagggcttg 


agttcaaggc 


catagtatcc 




atagctgaac 


acatcccgtg 


gctcaggtgg 




gagcacgggg 


ctcgtcgtga 


ccgcctcact 




cgtcaaaagt 


ctagtggagc 


gaaacagcat 




cagtatgatc 


ttagtgagga 


tactatcatt 


i ' — 1 

it' 


atggacacga 


cagcgataac 


agcggaatgg 


w 


gtgcaacaaa 


aagtgcaaga 


agagttcgac 


: is = 


gaggcagatt 


tctcccgctt 


accttacttg 




catcctccaa 


cgcctctaat 


gctacctcac 




tatgatattc 


ccaaaggatc 


aaacgttcat 


!□ 


gctgtatgga 


aaaatccatt 


tgagtttaga 


j sis 
: e s 






gctycccccy 


; ( _s_ 


gcacaacttg 


gtatcaattt 


ggtaacttcg 


; . — , 


tggacacctc 


ctcaagggac 


taaaccggag 




gttacttaca 


tgcgtacccc 


tgtgcaagcg 




tacaaacgcg 


tgccttacga 


tatgtaa 



<210> 2 

<211> 1527 

<212> DNA 

<213> Arabidopsis thaliana 



atatgggccg attatgggcc tcattacgtg 360 

ttcacaccga aacgactcga gtctctcaga 420 

gttgaatccg tcttcagaga ctgtaacctt 480 

aggaagtact taggagcggt tgcgttcaac 540 

tttatgaacg ctgaaggtgt tgtggacgag 600 

aacggtctga agctaggtgc ttcactgtca 660 

atgtttccgg ctgatgagaa ggcgtttgct 720 

cgagctatca tggaggagca tactttggcc 780 

ttcgttgatg cgttgctaac gttgaaggat 84 0 

ggtcttctat gggatatgat cacggcaggg 900 

gcgatggcgg aaatgatcaa gaatccaaga 960 

agagtggttg gacttgaccg gatcttaacc 1020 

caatgcgtgg tgaaagagtc attcaggctg 1080 

cgaagcaacg cagatgtcaa gatcggaggc 1140 

gtgaatgtgt gggctgtggc tagagacccg 1200 

ccagagagat tcttggaaga agatgttgac 12 60 

tttggagctg gaagacgggt ttgtcccggt 132 0 

atgatgagtc atttgcttca ccattttgtt 1380 

gagattgaca tgtctgaaaa ccctggactc 144 0 

gttgcaacgc ctcggttgcc ttcggatctg 1500 

1527 



<400> 2 

atgtcgtggt ttctaatagc ggtggcgaca atcgccgccg tcgtatccta caagctaatc 



60 



caacggctaa gatacaagtt cccaccaggc 
tacgacataa aaccggtccg gttcagatgt 
atcatatcgg tctggatcgg ttcaattcta 
aaagaagttc tgaaagaaca cgaccagaaa 
gaagcattta gccgcaacgg tcaggatctt 
aaggtgagaa aagtttgcac gcttgagctc 
cctatccgtg aagatgaagt caccgccatg 
cctgaaaaca gagcaaaagg tttacaactg 
aacataacgc ggctagcctt tgggaagcgt 
caagggcttg agttcaaggc catagtatcc 
.f% atagctgaac acatcccgtg gctcaggtgg 

* Est 

*C 

gagcacgggg ctcgtcgtga ccgcctcact 

\& 

Ij 3 * cgtcaaaagt ctagtggagc gaaacagcat 

i'U 

|f] cagtatgatc ttagtgagga tactatcatt 

, atggacacga cagcgataac agcggaatgg 

P 

m gtgcaacaaa aagtgcaaga agagttcgac 

*Z gaggcagatt tctcccgctt accttacttg 

Ij] 

13 catcctccaa cgcctctaat gctacctcac 
tatgatattc ccaaaggatc aaacgttcat 
gctgtatgga aaaatccatt tgagtttaga 
atgaagggtc atgattttag gctgcttccg 
gcacaacttg atatcaattt ggtaacttcg 
tggacacctc ctcaagggac taaaccggag 
gttacttaca tgcgtacccc tgtgcaagcg 
tacaaacgcg tgccttacga tatgtaa 



3 



ccaagcccca 


agccgatcgt 


cggtaacctc 


120 


tactacgagt 


gggctcaatc 


ttatggacca 


180 


aacgtggtcg 


tatctagcgc 


cgagctagca 


240 


ctcgccgacc 


ggcaccggaa 


cagatcgacg 


300 


atatgggccg 


attatgggcc 


tcattacgtg 


360 


ttcacaccga 


aacgactcga 


gtctctcaga 


420 


gttgaatccg 


tcttcagaga 


ctgtaacctt 


480 


aggaagtact 


taggagcggt 


tgcgttcaac 


540 


tttatgaacg 


ctgaaggtgt 


tgtggacgag 


600 


aacggtctga 


agctaggtgc 


ttcactgtca 


660 


atgtttccgg 


ctgatgagaa 


ggcgtttgct 


720 


cgagctatca 


tggaggagca 


tactttggcc 


780 


ttcgttgatg 


cgttgctaac 


gttgaaggat 


840 


ggtcttctat 


gggatatgat 


cacggcaggg 


900 


gcgatggcgg 


aaatgatcaa 


gaatccaaga 


960 


agagtggttg 


gacttgaccg 


gatcttaacc 


1020 


caatgcgtgg 


tgaaagagtc 


attcaggctg 


1080 


cgaagcaacg 


cagatgtcaa 


gatcggaggc 


1140 


gtgaatgtgt 


gggctgtggc 


tagagacccg 


1200 


ccagagagat 


tcttggaaga 


agatgttgac 


1260 


tttggagctg 


gaagacgggt 


ttgtcccggt 


1320 


atgatgagtc 


atttgcttca 


ccattttgtt 


1380 


gagattgaca 


tgtctgaaaa 


ccctggactc 


1440 


gttgcaacgc 


ctcggttgcc 


ttcggatctg 


1500 



1527 



<210> 3 
<211> 154 



9 



4 



<212> DNA 

<213> Arabidopsis thaliana 





<400> 3 
gcaaggatcc 


atgtcgtggt 


ttctaatagc 


ggtggcgaca 


atcgccgccg 


tcgtatccta 


60 




caagctaatc 


caacggctaa 


gatacaagtt 


cccaccaggc 


ccaagcccca 


agccgatcgt 


120 




cggtaacctc 


tacgacataa 


aaccggtccg 


gttcagatgt 


tactacgagt 


gggctcaatc 


180 




ttatggacca 


atcatatcgg 


tctggatcgg 


ttcaattcta 


aacgtggtcg 

ZJ ZJ ZJ ZJ 


tatctagcgc 


240 




cgagctagca 


aaagaagttc 


tgaaagaaca 


cgaccagaaa 


ctcgccgacc 


ggcaccggaa 


300 




cagatcgacg 


gaagcattta 


gccgcaacgg 


tcaggatctt 


atatqqqccq 


attatgggcc 


360 




tcattacgtg 


aaggtgagaa 


aagtttgcac 


gcttgagctc 


ttcacaccga 


aacgactcga 


420 


^0 


gtctctcaga 


cctatccgtg 


aagatgaagt 


caccgccatg 


qttqaatccq 


tcttcagaga 


480 




ctgtaacctt 


cctgaaaaca 


gagcaaaagg 


tttacaactg 


aggaagtact 


taqqaqcqqt 


540 


jy 


tgcgttcaac 


aacataacgc 


ggctagcctt 


tgggaaqcqt 

ZJZJZJ ZJ J 


tttatgaacg 


ctqaaqqtqt 


600 




tgtggacgag 


caagggcttg 

ZJ ZJ Z3 ZJ 


agttcaaggc 


catagtatcc 


aacqqtctaa 


agctaggtgc 


660 


? tSST 


ttcactgtca 


atagctgaac 


acatcccgtg 


qctcaqqtqq 


atgtttccgg 


ctgatgagaa 


720 


{9 


ggcgtttgct 


gagcacgggg 


ctcgtcgtga 


ccgcctcact 


cgagctatca 


tqqaqqaQca 


780 


!ii 

t tssr 


tactttggcc 


cgtcaaaagt 


ctagtggagc 

mJ — » Z* ZJ 


gaaacagcat 


ttcgttgatg 


cgttgctaac 


840 




gttgaaggat 


cagtatgatc 


ttagtgagga 


tactatcatt 


ggtcttctat 


gggatatqat 

ZJZJZJ ZJ 


900 




cacggcaggg 


atggacacga 


cagcgataac 


agcggaatgg 


gcgatggcgg 


aaatgatcaa 


960 




gaatccaaga 


gtgcaacaaa 


aagtgcaaga 


agagttcgac 


agagtggttg 


gacttgaccg 


1020 




gatcttaacc 


gaggcagatt 


tctcccgctt 


accttacttg 


caatgcgtgg 


tgaaagagtc 


1080 




attcaggctg 


catcctccaa 


cgcctctaat 


gctacctcac 


cgaagcaacg 


cagatgtcaa 


1140 




gatcggaggc 


tatgatattc 


ccaaaggatc 


aaacgttcat 


gtgaatgtgt 


gggctgtggc 


1200 




tagagacccg 


gctgtatgga 


aaaatccatt 


tgagtttaga 


ccagagagat 


tcttggaaga 


1260 




agatgttgac 


atgaagggtc 


atgattttag 


gctgcttccg 


tttggagctg 


gaagacgggt 


1320 




ttgtcccggt 


gcacaacttg 


gtatcaattt 


ggtaacttcg 


atgatgagtc 


atttgcttca 


1380 




ccattttgtt 


tggacacctc 


ctcaagggac 


taaaccggag 


gagattgaca 


tgtctgaaaa 


1440 



i 



5 

ccctggactc gttacttaca tgcgtacccc tgtgcaagcg gttgcaacgc ctcggttgcc 1500 
ttcggatctg tacaaacgcg tgccttacga tatgtaaatg aattcctga 1549 

<210> 4 
<211> 508 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 4 

Met Ser Trp Phe Leu lie Ala Val Ala Thr lie Ala Ala Val Val Ser 
15 10 15 

□ 

Tyr Lys Leu lie Gin Arg Leu Arg Tyr Lys Phe Pro Pro Gly Pro Ser 
20 25 30 

Pro Lys Pro lie Val Gly Asn Leu Tyr Asp lie Lys Pro Val Arg Phe 
^ 35 40 45 



*l5 



in 



Arg Cys Tyr Tyr Glu Trp Ala Gin Ser Tyr Gly Pro lie lie Ser Val 
50 55 60 

Trp He Gly Ser He Leu Asn Val Val Val Ser Ser Ala Glu Leu Ala 
65 70 75 80 

Lys Glu Val Leu Lys Glu His Asp Gin Lys Leu Ala Asp Arg His Arg 
85 90 95 

Asn Arg Ser Thr Glu Ala Phe Ser Arg Asn Gly Gin Asp Leu He Trp 
100 105 110 

Ala Asp Tyr Gly Pro His Tyr Val Lys Val Arg Lys Val Cys Thr Leu 
115 120 125 

Glu Leu Phe Thr Pro Lys Arg Leu Glu Ser Leu Arg Pro He Arg Glu 
130 135 140 

Asp Glu Val Thr Ala Met Val Glu Ser Val Phe Arg Asp Cys Asn Leu 
145 150 155 160 

Pro Glu Asn Arg Ala Lys Gly Leu Gin Leu Arg Lys Tyr Leu Gly Ala 
165 170 175 

Val Ala Phe Asn Asn He Thr Arg Leu Ala Phe Gly Lys Arg Phe Met 
180 185 190 



Asn Ala Glu Gly Val Val Asp Glu Gin Gly Leu Glu Phe Lys Ala He 
195 200 205 



• 



in 
in 



3 

!b=F 

m 



Val Ser Asn Gly Leu Lys Leu Gly Ala Ser Leu Ser lie Ala Glu His 
210 215 220 

lie Pro Trp Leu Arg Trp Met Phe Pro Ala Asp Glu Lys Ala Phe Ala 
225 230 235 240 

Glu His Gly Ala Arg Arg Asp Arg Leu Thr Arg Ala lie Met Glu Glu 
245 250 255 

His Thr Leu Ala Arg Gin Lys Ser Ser Gly Ala Lys Gin His Phe Val 
260 265 270 

Asp Ala Leu Leu Thr Leu Lys Asp Gin Tyr Asp Leu Ser Glu Asp Thr 
275 280 285 

lie lie Gly Leu Leu Trp Asp Met lie Thr Ala Gly Met Asp Thr Thr 
290 295 300 

Ala lie Thr Ala Glu Trp Ala Met Ala Glu Met lie Lys Asn Pro Arg 
305 310 315 320 

Val Gin Gin Lys Val Gin Glu Glu Phe Asp Arg Val Val Gly Leu Asp 
325 330 335 

Arg lie Leu Thr Glu Ala Asp Phe Ser Arg Leu Pro Tyr Leu Gin Cys 
340 345 350 

Val Val Lys Glu Ser Phe Arg Leu His Pro Pro Thr Pro Leu Met Leu 
355 360 365 

Pro His Arg Ser Asn Ala Asp Val Lys lie Gly Gly Tyr Asp lie Pro 
370 375 380 



Lys Gly Ser Asn Val His Val Asn Val Trp Ala Val Ala Arg Asp Pro 
'■3 385 390 395 400 

Ala Val Trp Lys Asn Pro Phe Glu Phe Arg Pro Glu Arg Phe Leu Glu 
405 410 415 

Glu Asp Val Asp Met Lys Gly His Asp Phe Arg Leu Leu Pro Phe Gly 
420 425 430 

Ala Gly Arg Arg Val Cys Pro Gly Ala Gin Leu Gly lie Asn Leu Val 
435 440 445 

Thr Ser Met Met Ser His Leu Leu His His Phe Val Trp Thr Pro Pro 
450 455 460 

Gin Gly Thr Lys Pro Glu Glu lie Asp Met Ser Glu Asn Pro Gly Leu 
465 470 475 480 

Val Thr Tyr Met Arg Thr Pro Val Gin Ala Val Ala Thr Pro Arg Leu 
485 490 495 



Pro Ser Asp Leu Tyr Lys Arg Val Pro Tyr Asp Met 
500 505 

<210> 5 
<211> 508 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 5 



Met Ser Trp Phe 
1 

Tyr Lys Leu He 
20 

Pro Lys Pro He 
35 

Arg Cys Tyr Tyr 
50 

Trp He Gly Ser 
65 

Lys Glu Val Leu 



Leu He Ala Val 
5 

Gin Arg Leu Arg 



Val Gly Asn Leu 
40 

Glu Trp Ala Gin 
55 

He Leu Asn Val 
70 

Lys Glu His Asp 
85 



Ala Thr He Ala 
10 

Tyr Lys Phe Pro 
25 

Tyr Asp He Lys 



Ser Tyr Gly Pro 
60 

Val Val Ser Ser 
75 

Gin Lys Leu Ala 
90 



Ala Val Val Ser 
15 

Pro Gly Pro Ser 
30 

Pro Val Arg Phe 
45 

He He Ser Val 



Ala Glu Leu Ala 
80 

Asp Arg His Arg 
95 



Asn Arg Ser Thr 
100 

Ala Asp Tyr Gly 
115 

Glu Leu Phe Thr 
130 

Asp Glu Val Thr 
145 

Pro Glu Asn Arg 



Val Ala Phe Asn 
180 

Asn Ala Glu Gly 
195 



Glu Ala Phe Ser 



Pro His Tyr Val 
120 

Pro Lys Arg Leu 
135 

Ala Met Val Glu 
150 

Ala Lys Gly Leu 
165 

Asn He Thr Arg 



Val Val Asp Glu 
200 



Arg Asn Gly Gin 
105 

Lys Val Arg Lys 



Glu Ser Leu Arg 
140 

Ser Val Phe Arg 
155 

Gin Leu Arg Lys 
170 

Leu Ala Phe Gly 
185 

Gin Gly Leu Glu 



Asp Leu He Trp 
110 

Val Cys Thr Leu 
125 

Pro He Arg Glu 



Asp Cys Asn Leu 
160 

Tyr Leu Gly Ala 
175 

Lys Arg Phe Met 
190 

Phe Lys Ala He 
205 



8 



Val Ser Asn Gly Leu Lys Leu Gly Ala Ser Leu Ser lie Ala Glu His 
210 215 220 

lie Pro Trp Leu Arg Trp Met Phe Pro Ala Asp Glu Lys Ala Phe Ala 
225 230 235 240 

Glu His Gly Ala Arg Arg Asp Arg Leu Thr Arg Ala lie Met Glu Glu 
245 250 255 

His Thr Leu Ala Arg Gin Lys Ser Ser Gly Ala Lys Gin His Phe Val 
260 265 270 

Asp Ala Leu Leu Thr Leu Lys Asp Gin Tyr Asp Leu Ser Glu Asp Thr 
275 280 285 

lie lie Gly Leu Leu Trp Asp Met lie Thr Ala Gly Met Asp Thr Thr 
290 295 300 

Ala lie Thr Ala Glu Trp Ala Met Ala Glu Met lie Lys Asn Pro Arg 
305 310 315 320 

Val Gin Gin Lys Val Gin Glu Glu Phe Asp Arg Val Val Gly Leu Asp 
325 330 335 

Arg lie Leu Thr Glu Ala Asp Phe Ser Arg Leu Pro Tyr Leu Gin Cys 
340 345 350 

Val Val Lys Glu Ser Phe Arg Leu His Pro Pro Thr Pro Leu Met Leu 
355 360 365 

Pro His Arg Ser Asn Ala Asp Val Lys lie Gly Gly Tyr Asp lie Pro 
370 375 380 

Lys Gly Ser Asn Val His Val Asn Val Trp Ala Val Ala Arg Asp Pro 
385 390 395 400 

Ala Val Trp Lys Asn Pro Phe Glu Phe Arg Pro Glu Arg Phe Leu Glu 
405 410 415 

Glu Asp Val Asp Met Lys Gly His Asp Phe Arg Leu Leu Pro Phe Gly 
420 425 430 

Ala Gly Arg Arg Val Cys Pro Gly Ala Gin Leu Asp lie Asn Leu Val 
435 440 445 

Thr Ser Met Met Ser His Leu Leu His His Phe Val Trp Thr Pro Pro 
450 455 460 

Gin Gly Thr Lys Pro Glu Glu lie Asp Met Ser Glu Asn Pro Gly Leu 
465 470 475 480 



Val Thr Tyr Met Arg Thr Pro Val Gin Ala Val Ala Thr Pro Arg Leu 
485 490 495 



# 



i5% 



Pro Ser Asp Leu Tyr Lys Arg Val Pro Tyr Asp Met 
500 505 

<210> 6 
<211> 477 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 6 

Ser Pro Lys Pro lie Val Gly Asn Leu Tyr Asp He Lys Pro Val Arg 
15 10 15 

Phe Arg Cys Tyr Tyr Glu Trp Ala Gin Ser Tyr Gly Pro He He Ser 
20 25 30 

Val Trp He Gly Ser He Leu Asn Val Val Val Ser Ser Ala Glu Leu 
35 40 45 

Ala Lys Glu Val Leu Lys Glu His Asp Gin Lys Leu Ala Asp Arg His 
50 55 60 



i Arg Asn Arg Ser Thr Glu Ala Phe Ser Arg Asn Gly Gin Asp Leu He 

13 65 70 75 80 

i'n 

1^ Trp Ala Asp Tyr Gly Pro His Tyr Val Lys Val Arg Lys Val Cys Thr 
jp 85 90 95 

J* Leu Glu Leu Phe Thr Pro Lys Arg Leu Glu Ser Leu Arg Pro He Arg 
3 "* 100 105 110 

Glu Asp Glu Val Thr Ala Met Val Glu Ser Val Phe Arg Asp Cys Asn 
115 120 125 

Leu Pro Glu Asn Arg Ala Lys Gly Leu Gin Leu Arg Lys Tyr Leu Gly 
130 135 140 

Ala Val Ala Phe Asn Asn He Thr Arg Leu Ala Phe Gly Lys Arg Phe 
145 150 155 160 

Met Asn Ala Glu Gly Val Val Asp Glu Gin Gly Leu Glu Phe Lys Ala 
165 170 175 

He Val Ser Asn Gly Leu Lys Leu Gly Ala Ser Leu Ser He Ala Glu 
180 185 190 



His He Pro Trp Leu Arg Trp Met Phe Pro Ala Asp Glu Lys Ala Phe 
195 200 205 



10 



Ala Glu His Gly Ala Arg Arg Asp Arg Leu Thr Arg Ala lie Met Glu 
210 215 220 

Glu His Thr Leu Ala Arg Gin Lys Ser Ser Gly Ala Lys Gin His Phe 
225 230 235 240 

Val Asp Ala Leu Leu Thr Leu Lys Asp Gin Tyr Asp Leu Ser Glu Asp 
245 250 255 

Thr lie lie Gly Leu Leu Trp Asp Met lie Thr Ala Gly Met Asp Thr 
260 265 270 

Thr Ala lie Thr Ala Glu Trp Ala Met Ala Glu Met He Lys Asn Pro 
275 280 285 

Arg Val Gin Gin Lys Val Gin Glu Glu Phe Asp Arg Val Val Gly Leu 
290 295 300 

Asp Arg He Leu Thr Glu Ala Asp Phe Ser Arg Leu Pro Tyr Leu Gin 
305 310 315 320 

Cys Val Val Lys Glu Ser Phe Arg Leu His Pro Pro Thr Pro Leu Met 
325 330 335 

Leu Pro His Arg Ser Asn Ala Asp Val Lys He Gly Gly Tyr Asp He 
340 345 350 

Pro Lys Gly Ser Asn Val His Val Asn Val Trp Ala Val Ala Arg Asp 
355 360 365 

Pro Ala Val Trp Lys Asn Pro Phe Glu Phe Arg Pro Glu Arg Phe Leu 
370 375 380 

Glu Glu Asp Val Asp Met Lys Gly His Asp Phe Arg Leu Leu Pro Phe 
385 390 395 400 

Gly Ala Gly Arg Arg Val Cys Pro Gly Ala Gin Leu Gly He Asn Leu 
405 410 415 

Val Thr Ser Met Met Ser His Leu Leu His His Phe Val Trp Thr Pro 
420 425 430 

Pro Gin Gly Thr Lys Pro Glu Glu He Asp Met Ser Glu Asn Pro Gly 
435 440 445 

Leu Val Thr Tyr Met Arg Thr Pro Val Gin Ala Val Ala Thr Pro Arg 
450 455 460 

Leu Pro Ser Asp Leu Tyr Lys Arg Val Pro Tyr Asp Met 
465 470 475 

<210> 7 



<211> 477 



<212> PRT 

<213> Arabidopsis thaliana 



<400> 7 

Ser Pro Lys Pro lie Val Gly Asn Leu Tyr Asp lie Lys Pro Val Arg 
15 10 15 

Phe Arg Cys Tyr Tyr Glu Trp Ala Gin Ser Tyr Gly Pro lie lie Ser 
20 -25 30 

Val Trp lie Gly Ser lie Leu Asn Val Val Val Ser Ser Ala Glu Leu 
35 40 45 

Ala Lys Glu Val Leu Lys Glu His Asp Gin Lys Leu Ala Asp Arg His 
50 55 60 

Arg Asn Arg Ser Thr Glu Ala Phe Ser Arg Asn Gly Gin Asp Leu lie 
65 70 75 80 

Trp Ala Asp Tyr Gly Pro His Tyr Val Lys Val Arg Lys Val Cys Thr 
85 90 95 

Leu Glu Leu Phe Thr Pro Lys Arg Leu Glu Ser Leu Arg Pro lie Arg 
100 105 110 

Glu Asp Glu Val Thr Ala Met Val Glu Ser Val Phe Arg Asp Cys Asn 
115 120 125 

Leu Pro Glu Asn Arg Ala Lys Gly Leu Gin Leu Arg Lys Tyr Leu Gly 
130 135 140 

Ala Val Ala Phe Asn Asn lie Thr Arg Leu Ala Phe Gly Lys Arg Phe 
145 150 155 160 

Met Asn Ala Glu Gly Val Val Asp Glu Gin Gly Leu Glu Phe Lys Ala 
165 170 175 

lie Val Ser Asn Gly Leu Lys Leu Gly Ala Ser Leu Ser lie Ala Glu 
180 185 190 

His lie Pro Trp Leu Arg Trp Met Phe Pro Ala Asp Glu Lys Ala Phe 
195 200 205 

Ala Glu His Gly Ala Arg Arg Asp Arg Leu Thr Arg Ala lie Met Glu 
210 215 220 

Glu His Thr Leu Ala Arg Gin Lys Ser Ser Gly Ala Lys Gin His Phe 
225 230 235 240 



Val Asp Ala Leu Leu Thr Leu Lys Asp Gin Tyr Asp Leu Ser Glu Asp 
245 250 255 
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Thr He 



Thr Ala 



Arg Val 
290 



He Gly 
260 

He Thr 
275 

Gin Gin 



Leu Leu Trp 
Ala Glu Trp 



Lys Val Gin 
295 



Asp Met 
265 

Ala Met 
280 

Glu Glu 



He Thr Ala 



Ala Glu Met 



Phe Asp Arg 
300 



Gly Met Asp Thr 
270 

He Lys Asn Pro 
285 

Val Val Gly Leu 



Asp Arg 
305 



He Leu 



Cys Val Val Lys 



Leu Pro 



Pro Lys 



Pro Ala 
370 



His Arg 
340 

Gly Ser 
355 

Val Trp 



Thr Glu Ala 
310 

Glu Ser Phe 
325 

Ser Asn Ala 



Asn Val His 



Lys Asn Pro 
375 



Asp Phe 
Arg Leu 



Asp Val 
345 

Val Asn 
360 

Phe Glu 



Ser Arg Leu 
315 

His Pro Pro 
330 

Lys He Gly 
Val Trp Ala 



Phe Arg Pro 
380 



Pro Tyr Leu Gin 
320 

Thr Pro Leu Met 
335 

Gly Tyr Asp He 
350 

Val Ala Arg Asp 
3 65 

Glu Arg Phe Leu 



Glu Glu 
385 

Gly Ala 
Val Thr 



Asp Val 
Gly Arg 



Ser Met 
420 



Asp Met Lys 
390 

Arg Val Cys 
405 

Met Ser His 



Gly His 
Pro Gly 



Leu Leu 
425 



Asp Phe Arg 
395 

Ala Gin Leu 
410 

His His Phe 



Leu Leu Pro Phe 
400 

Asp He Asn Leu 
415 

Val Trp Thr Pro 
430 



Pro Gin 



Gly Thr 
435 



Lys Pro Glu 



Glu He 
440 



Asp Met Ser 



Glu Asn Pro Gly 
445 



Leu Val 
450 



Thr Tyr 



Met Arg Thr 
455 



Pro Val 



Gin Ala Val 
460 



Ala Thr Pro Arg 



Leu Pro Ser Asp Leu Tyr Lys Arg Val Pro Tyr Asp Met 
465 470 475 



